This report analyzes MLB offensive performance indicators from 2021–2025 and projects team and player-level trends through 2028. Using data from Baseball Savant, we identified key metrics — OPS, ISO, OBP, SLG, and exit velocity — that strongly influence offensive success. This report is designed for MLB roster executives who are entering the field and need data-driven insights for player evaluation and offensive strategy.
Data-driven insights help prioritize acquisitions, lineup decisions, and development focus for immediate and long-term impact.
Players: Top performers show consistent OPS and ISO trends; exit velocity correlates moderately with power metrics. Projections highlight potential breakout players and those likely to sustain high performance.
Teams: Linear regression projections of Win%, SLG, OBP, RBI, ISO, and OPS suggest which teams may improve or decline over the next three seasons, helping anticipate competitive dynamics.
Recommendations:
Focus on players with consistently high OPS and ISO to strengthen your lineup.
Monitor emerging talents who show upward trends in exit velocity and isolated power.
At the team level, invest in strategies that improve OBP and SLG to maximize run production.
Our team analyzed MLB batting performance indicators using data from Baseball Savant covering the period from 2021–2025. This analysis focuses on metrics that are predictive of team and player success, such as EV, OPS, ISO, OBP, and SLG.
Decision-maker:
Research Question:
Impact:
Primary Datasets:
MLB team stats 2021-2025.csv
MLB player stats 2021-2025.csv
Data characteristics:
Data Metrics Glossary:
1) Player-level Metrics
| Abbreviation | Full Name / Description |
|---|---|
| last_name | Player’s last name |
| first_name | Player’s first name |
| player_id | MLB’s unique ID for the player, used for data merging |
| year | Season year the stats come from |
| player_age | Player’s age during that season |
| pa | Plate appearances |
| hit | Total hits recorded |
| k_percent | Percentage of plate appearances that end in a strikeout |
| bb_percent | Percentage of plate appearances that end in a walk |
| batting_avg | Batting average: how often the player gets a hit per at-bat |
| slg_percent | Slugging percentage: measures hitting power by counting total bases per at-bat |
| on_base_percent | How often the player reaches base by any method (hit, walk, hit-by-pitch) |
| on_base_plus_slg | OPS: combined measure of getting on base (OBP) and hitting for power (SLG) |
| isolated_power | ISO: measures pure power by counting only extra-base hits |
| babip | Batting average on balls in play: how often a ball put in play becomes a hit (excl. HR & strikeouts) |
| b_rbi | Expected/adjusted RBI metric based on quality of contact |
| xba | Expected batting average based on batted-ball quality |
| xslg | Expected slugging percentage based on batted-ball quality |
| woba | Weighted on-base average: advanced metric measuring total offensive value using run-values for each event |
| xwoba | Expected wOBA based on contact quality |
| xobp | Expected on-base percentage |
| xiso | Expected ISO (expected power output) |
| wobacon | wOBA on contact only (ignores strikeouts and walks) |
| xwobacon | Expected wOBA on contact based on batted-ball data |
| exit_velocity_avg | Average exit velocity (mph) of all balls the player hits |
2) Team-level Metrics
| Abbreviation | Full Name / Description |
|---|---|
| Win% | Team winning percentage over the season |
| SLG | Slugging percentage: measures hitting power by total bases |
| OBP | On-base percentage: how often the team reaches base |
| RBI | Runs batted in: total runs produced by the team |
| ISO | Isolated power: extra-base hitting ability |
| OPS | On-base plus slugging: combined measure of offensive value |
Data Overview:
Top performers in 2025 show consistent OPS and ISO trends.
Exit velocity correlates moderately with slugging metrics.
Team-level projections show potential growth in offensive output for certain teams (expected Win% increase for top 10 teams).
Summary statistics:
1) Player-level stats: Calculated for the main KPIs (Exit Velocity, SLG, Batting Average, OBP, RBI, ISO, and OPS).
| Statistic | Exit Velocity (mph) | SLG | Batting Average | OBP | RBI | ISO | OPS |
|---|---|---|---|---|---|---|---|
| Mean | 89.70 | 0.44 | 0.26 | 0.33 | 74.76 | 0.18 | 0.77 |
| Median | 89.70 | 0.44 | 0.26 | 0.33 | 73.00 | 0.18 | 0.77 |
| StdDev | 2.16 | 0.06 | 0.03 | 0.03 | 19.63 | 0.05 | 0.08 |
| Variance | 4.65 | 0.00 | 0.00 | 0.00 | 385.49 | 0.00 | 0.01 |
| Min | 82.30 | 0.27 | 0.18 | 0.24 | 23.00 | 0.05 | 0.56 |
| Max | 96.20 | 0.70 | 0.35 | 0.46 | 144.00 | 0.38 | 1.16 |
| Range | 13.90 | 0.43 | 0.17 | 0.23 | 121.00 | 0.33 | 0.60 |
| Statistic | Exit Velocity (mph) | SLG | Batting Average | OBP | RBI | ISO | OPS |
|---|---|---|---|---|---|---|---|
| Mean | 89.37 | 0.41 | 0.25 | 0.32 | 67.74 | 0.16 | 0.73 |
| Median | 89.32 | 0.41 | 0.25 | 0.32 | 65.80 | 0.16 | 0.73 |
| StdDev | 2.42 | 0.08 | 0.03 | 0.04 | 22.65 | 0.06 | 0.10 |
| Variance | 5.84 | 0.01 | 0.00 | 0.00 | 513.13 | 0.00 | 0.01 |
| Min | 84.14 | 0.20 | 0.17 | 0.21 | 16.86 | 0.02 | 0.43 |
| Max | 94.87 | 0.57 | 0.32 | 0.40 | 123.62 | 0.30 | 0.96 |
| Range | 10.73 | 0.37 | 0.15 | 0.19 | 106.76 | 0.28 | 0.53 |
2) Team-level stats: Calculated for the main KPIs (Win%, SLG, OBP, RBI, ISO, OPS).
| Statistic | Win Percentage | SLG | OBP | RBI | ISO | OPS |
|---|---|---|---|---|---|---|
| Mean | 0.50 | 0.40 | 0.32 | 686.65 | 0.15 | 0.72 |
| Median | 0.50 | 0.39 | 0.32 | 683.00 | 0.15 | 0.72 |
| StdDev | 0.05 | 0.02 | 0.01 | 32.90 | 0.01 | 0.02 |
| Variance | 0.00 | 0.00 | 0.00 | 1082.32 | 0.00 | 0.00 |
| Min | 0.42 | 0.37 | 0.30 | 630.00 | 0.14 | 0.69 |
| Max | 0.57 | 0.44 | 0.33 | 746.00 | 0.19 | 0.76 |
| Range | 0.16 | 0.06 | 0.03 | 116.00 | 0.05 | 0.07 |
| Statistic | Win Percentage | SLG | OBP | RBI | ISO | OPS |
|---|---|---|---|---|---|---|
| Mean | 0.50 | 0.40 | 0.32 | 686.91 | 0.15 | 0.72 |
| Median | 0.50 | 0.39 | 0.32 | 682.99 | 0.15 | 0.72 |
| StdDev | 0.05 | 0.02 | 0.01 | 34.74 | 0.01 | 0.02 |
| Variance | 0.00 | 0.00 | 0.00 | 1206.55 | 0.00 | 0.00 |
| Min | 0.39 | 0.37 | 0.30 | 626.25 | 0.13 | 0.68 |
| Max | 0.58 | 0.44 | 0.33 | 771.31 | 0.19 | 0.77 |
| Range | 0.19 | 0.07 | 0.03 | 145.07 | 0.06 | 0.09 |
These plots show projected team-level KPIs for the next three seasons of top 10 teams based on linear regression with correlated predictors. Top Teams are Selected Based on 2025 Win%.
Projected Win Percentage (2026–2028)
Description: Projected win percentage for top MLB teams, showing
expected growth/decline in team performance over three years.
Projected SLG (2026–2028)
Description: Shows projected slugging performance of top teams.
Patterns indicate which teams may improve power hitting.
Projected OBP (2026–2028)
Description: Team on-base percentage projection, reflecting plate
discipline and consistency.
Projected RBI (2026–2028)
Description: Estimated run production by team, linked to scoring
potential.
Projected ISO (2026–2028)
Description: Measures team isolated power, highlighting teams likely
to hit extra-base hits.
Projected OPS (2026–2028)
Description: Combined on-base plus slugging metric, giving a broad
measure of offensive efficiency.
These plots show projected player-level KPIs for the top 10 players based on prior performance and correlated metrics.
Projected SLG for Top Players
Description: Expected slugging trends for top players, highlighting
consistency and potential breakout performers.
Projected OPS for Top Players
Description: Combines OBP and SLG for a holistic view of player
offensive output.
Projected ISO for Top Players
Description: Player isolated power trends, indicating extra-base
hitting capability.
Projected Batting Average for Top Players
Description: Tracks expected hit rate per at-bat for top
players.
Projected OBP for Top Players
Description: Player on-base percentage projections, reflecting
consistency and plate discipline.
Projected RBI for Top Players
Description: Expected run production per player, linked to scoring
potential.
Projected Exit Velocity for Top Players
Description: Exit velocity trend predictions, showing higher values
often correlate with power hitting.
These plots show relationships between exit velocity and player hitting stats, and overall correlations among KPIs.
Description: This scatterplot shows a clear positive relationship
between Exit Velocity (EV) and Slugging Percentage (SLG). Players with
higher EV tend to produce more extra-base power. The moderate
correlation (0.62) aligns with the upward trend observed, indicating
that EV is a meaningful but not sole predictor of SLG.
Description: The plot indicates a moderate positive relationship
between EV and Runs Batted In (RBI), with a correlation of 0.55. While
stronger contact contributes to more run production, RBI is also
context-dependent, influenced by lineup position and teammates’
baserunning. The scatterplot trend supports the moderate correlation
seen in the heatmap.
Description: The scatterplot shows a positive trend between EV and
On-base Plus Slugging (OPS), with a correlation of 0.58. EV contributes
to OPS largely through its influence on SLG and power-related outcomes.
The scatterplot confirms a moderate association, highlighting that EV is
relevant but not the only determinant of OPS.
Description: This scatterplot demonstrates a strong positive
relationship between EV and Isolated Power (ISO), with a correlation of
0.65. Players who hit the ball harder tend to generate more power
independent of batting average, confirming EV’s strong link to pure
power metrics.
Players Metrics Heatmap
Description: The heatmap summarizes the relationships among all player offensive metrics. EV shows moderate correlations with power-oriented KPIs (SLG, ISO, OPS, and RBI), reinforcing its role as a useful predictor for future offensive performance. Weakly correlated metrics like batting average and OBP are visually confirmed by the lack of strong trend in their scatterplots.
Team Rankings Heatmap (2021–2025 averages)
Description: This
heatmap ranks teams across key offensive KPIs (SLG, OBP, ISO, OPS, RBI,
and Win%). The color gradient highlights relative performance: red
indicates top-ranked teams, while blue represents the lower-performing
ones. This allows for quick identification of consistently elite
offenses versus teams that underperform across multiple KPIs. It also
helps validate the selection of top-10 teams for forward projections by
revealing their sustained strength across several metrics.
Tableau was used to explore, visualize, and communicate trends in both player- and team-level KPIs. It allows interactive filtering, ranking, and comparison across multiple metrics, making it easier to identify top-performing players and teams, as well as project future performance.
Sample visualization
Full Storyboard View the Full Interactive Tableau Storyboard
Python (Pandas, Matplotlib, Seaborn, Scikit-Learn)
Advantages: Excellent for data cleaning, automation of repetitive tasks, Strong statistical and machine-learning libraries for regression, correlation, and projections.
Challenges: Visualization requires more manual formatting to produce good quality plots. And it Requires careful path and environment management across machines and repos.
RStudio (for Markdown editing and report generation)
Advantages: Provides a clean interface for editing markdown files with real-time rendering, and easy way to embed figures, captions, and formatting for final report production.
Challenges: Figure dimension behavior differs from GitHub’s markdown rendering.
Our analysis combined historical MLB player and team-level statistics with regression-based projections to estimate future offensive performance for 2026–2028. By examining KPIs such as SLG, OBP, OPS, ISO, RBI, Batting Average, and Exit Velocity, we generated insights into which metrics are most indicative of future success.
Key Findings:
OPS and ISO consistently emerge as the strongest indicators of future offensive performance at both the team and player levels. Teams and players maintaining strong OPS trends also show positive projections in Win%, RBI, and SLG.
Exit Velocity has moderate correlation with power metrics (ISO ≈ 0.65, SLG ≈ 0.62), but does not strongly correlate with most KPIs. This means EV is useful, but not a primary predictor in our dataset.
Team Projections (2026–2028):
Player Projections (2026–2028):
Heatmaps and Scatter Plots:
Limitations:
The model only uses batting statistics; defensive and situational factors are not included.
Projections are based on linear regression, which may not capture sudden changes (injuries, role changes, coaching changes, etc.).
Extreme outliers were trimmed (5th–95th percentile) to stabilize results.
Future Work:
Add park effects, defensive WAR, sprint speed, pitch-level statcast features, injury history, and multivariate models.
Use machine learning methods for improved forecasting accuracy.
Hypothetical Decision:
Data-driven Recommendation:
Benefits:
Risks:
Challenges:
Victories:
| Member | Role | Contribution |
|---|---|---|
| Matthew D Gonzalez | Project Lead / Co Head Developer | Data cleansing + Writing code + visualization + findings and write-up |
| Jacob D Lamothe | Code Editor/Checker + Video Editor + Presentation/Narration Lead | Checks code for mistakes/redundancies + statistical validation + Edits video at the end of project |
| Rodolfo Lazaro | Visualization Designer | Tableau plots + checking visualizations |
| Samir Soliman | Head Developer | Import data + write codes + statistical Validation/Model Evaluation + findings and write-up |